A Segmentation-free Approach to Recognise Printed Sinhala Script
نویسنده
چکیده
Majority of character recognition algorithms such as the use of ANNs needs segmentation of the script prior to recognition. Contrast to Western scripts, Brahmi descended South Asian scripts such as Sinhala consist of modifier symbols, which make the segmentation a difficult task that needs to be addressed as a separate issue. Further, the change of shape of the basic character (by violating modification rules) in the modification process makes some modified Sinhala characters impossible to segment. The proposed method, which uses Linear Symmetry to examine a co-relation between characters in the script with the testing alphabet, recognises characters directly within the image of the script. A similar method is used to resolve confusing characters. Experiments show highly favourable results not only for the basic characters of the alphabet but also for the modifier symbols. A novel but simple method using Linear Symmetry for skew correction has also been proposed.
منابع مشابه
A segmentation-free approach to recognise printed Sinhala script using linear symmetry
In this paper, a novel approach for printed character recognition using linear symmetry is proposed. When the conventional character recognition methods such as the arti1cial neural network based techniques are used to recognise Brahmi Sinhala script, segmentation of modi1ed characters into modi1er symbols and basic characters is a necessity but a complex issue. The large size of the character ...
متن کاملLexicon and hidden Markov model-based optimisation of the recognised Sinhala script
The Brahmi descended Sinhala script is used by 75% of the 18 million population in Sri Lanka. To the best of our knowledge, none of the Brahmi descended scripts used by hundreds of millions of people in South Asia, possess commercial OCR products. In the process of implementation of an OCR system for the printed Sinhala script which is easily adoptable to similar scripts [Premaratne, L., Assabi...
متن کاملRecognition of Printed Sinhala Characters Using Linear Symmetry
Sinhala characters used in the Sinhala script by over 70% of the 18 million population in Sri Lanka, have been descended from the ancient Brahmi script. The Sinhala alphabet consists of vowels and consonants and the consonants are modified using modifier symbols to give the required vocal sounds. In the process of developing an OCR for the Sinhala script, characters are initially recognised thr...
متن کاملRecognition of Modification-based Scripts Using Direction Tensors
The research on the OCR technology for the Latinbased scripts has been successful in achieving the status of image scanners with built-in OCR facility. But, a majority of modification-based scripts such as Brahmi descended South Asian or Ethiopic scripts are still progressing to achieve this status. This indicates the difficulties in adopting the recognition methods that have been proposed so f...
متن کاملA Neural Network Based Character Recognition System for Sinhala Script
Much effort has been extended in making a computer recognise both typed and handwritten characters automatically. Until quite recently, the focus of this endeavour has been on characters of English Language. As for Asian languages such as Sinhala and Tamil, little or no attention has been given. Methods currently widely used for character recognition for these languages are mainly those which i...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004